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Abstract 

The maximum-entropy inference assigns to the mean values with respect 
to a fixed set of observables the unique density matrix, which is consistent 
with the mean values and which maximizes the von Neumann entropy. 

A discontinuity was recently found in this inference method for three-level 
quantum systems. For arbitrary finite-level quantum systems, we show that 
these discontinuities are no artefacts. While lying on the boundary of the 
set of mean values, they influence the inference of nearby mean values. We 
completely characterize the discontinuities by an openness condition on the 
linear map that assigns mean values. An example suggests that the openness 
condition is independent of the inference and can be formulated in terms of 
the convex geometry of the set of density matrices — this is left as an open 
problem. 

1 Introduction 

We consider the n-dimensional Euclidean vector space E n with inner product (•, •) 
and norm ||x|| := a/ (x, x). Unless otherwise specified, a subset X C E n is endowed 
with the relative topology induced from E n . Another issue is the topology of the 
affine hull of X, which defines relative interiors and relative boundaries in convex 
geometry. When topologies of affine hulls are used, this will be explicitly specified. 

Let X, Y C E n . A mapping it : Y — > X is open 2 at y G Y , if for each neigh- 
borhood 3 U(y) there is a neighborhood V(ir(y)) included in n{U{y)). The map 7T 
is open if it is open at each y G Y, or equivalently if every open set V C Y has an 
open image tt(V) in X. 

Definition 1. We consider a convex body S C E ra (compact, convex subset) and 
refer to it as state space and to its elements as states. 

This is motivated by the state space S = {p G A \ p >z 0,tr(p) = 1} of a 
real self-adjoint subalgebra A of Mat (AT, C), called *-subalgebra in [Ko], which is 
a convex body. It contains all density matrices, i.e. positive semi-definite matrices 
of trace one. Let t N resp. 0^ denote the identity resp. zero in Mat(A, C). The 
real vector space of self-adjoint matrices of A contains S and is a Euclidean vector 
space endowed with the Hilbert- Schmidt scalar product (a, b) = tr(afo) for self-adjoint 
a, b G A. 
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2 This definition differs from [Pa] where it : Y — >• X is called open at x € X if it is open at every 
y E 7t _1 (.t). The present definition suits our main results. 

3 A neighborhood of y is an open set containing y. 
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Example 1. We consider the *-subalgebra A of Mat(3, C) defined as the real linear 
span of the 3 x 3-matrices (embedded as block-diagonal matrices) 

1 2 ©0, (7i ©0, (7 2 ©0, i(7 3 ©0, 2 ©1 

with the Pauli a-matrices 

*i :=(?&), °2:=(U l )> *s:=(&-°i)- 
The state space is a three-dimensional cone (conv denoting convex hull) 

S = conv ({p(a) | a G [0, 2tt)} U {0 2 © 1}) 

based on the circle p(a) = |(1 2 + sin(a)ai + cos(a;)o" 2 ) © and with apex 2 © 1. 
The circle is an equator of the state space of Mat (2, C), known as Block ball. 

Definition 2. The second element we introduce is a linear subspace [/ CE". We 
denote the orthogonal projection onto U by 

ttu : E n -> U 

and the projection of the state space by 

M := ttu(S). 

The fiber (^Is) -1 (m) = {p E S \ itu{p) = m} of m G M gives rise to the multi- 
valued map 

We call M the mean value set and its elements mean values. 

The projection is motivated by the real vector space of self-adjoint matrices U of 
an operator system [Pn], i.e. a complex self-adjoint subspace of Mat(A r , C) containing 
In. If {ui, . . . , u k } is a spanning set of U, then providing the projection nu(p) of a 
density matrix p G Mat(iVC) is equivalent to providing the mean values 

(u l7 p),. . . ,(u k ,p). 

In Jaynes's maximum-entropy principle [Ja, Wi, St] mean values are commonly 
used and the self-adjoint matrices {u\, . . . , u k } are sometimes called observables. 
For simplicity and in agreement with [Kn] we use the projection Tiu in this article. 

Definition 3. The third and last element to be introduced is a function 

: S ->• R, 

which we refer to as dis orderliness measure. The fibers {F(m)} m£ M of 7Tu\s are 
selective (with respect to 0) if for each mean value m G M there is a unique p m G 
F{m), such that <f>(p m ) = max{0(p) | p G F(m)}, see [Be]. If the fibers are selective, 
then the function 

$ : M ->■ <S, $(m) = argmax{0(p) | p G F(m)} 
is defined. We call $ the inference. 
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Figure 1: a) A conic state space. The neighborhood U(c), defined in the text, lies 
in the drawing above an elliptic section of the cone (dashed), b) The elliptic mean 
value set is the projection of the state space onto the linear space U. The projection 
m = 7T[/(c) is a limit of mean values outside of TiuiU^c)) so irjj\s is n °t open at c. 



We call disorderliness measure in analogy to the "uncertainty" [Ja] or "disorder- 
liness" [Wi] that the von Neumann entropy — trplog(p) assigns to a density matrix p. 
The von Neumann entropy is strictly concave (hence the fibers of nu\s are selective) 
and continuous [We]. 

Our main result is stated in Theorem 2: 

Theorem. If <p : S — >■ R is a continuous disorderliness measure and the fibers of 
nu\s are selective, then the inference $ : M — > S is continuous at m G M if and 
only if the projection ttu\s '■ S ^ M is open at $(m). 

The "if" part of the theorem follows from an argument by Wichmann [Wi] (re- 
formulated in Theorem 1). The "only if" part is trivial and shall be proved now. 

Proof: We give an indirect proof. If nu\s is n °t open at $(m) for some m G 
M, then there exists a neighborhood £/($(m)) in S and a sequence m 8 C M \ 

7r[/(C/($(m))) such that m; m. Since $(m) G ?7(<3>(m)), the sequence $(mj) 
can not converge to $(m), thus $ is not continuous at m. □ 



2 Discussion and open questions 

Physically the openness of ttu\s means that any small ambiguity of the mean value 
can be realized by a small adjustment of the state. In particular, states of maximal 
disorderliness are non-local if 7Tu\s is not open at a state of maximal disorderliness. 

A similar physical meaning is attached to the openness of the barycenter map in 
the description of statistical ensembles of quantum states [Sh2]. This has its origin 
in the idea of stability [Pa] of a convex body S, meaning that the mid-point map 
S x S — ?■ S, (x,y) !->■ \{x + y) is open. We are not aware of any literature on the 
openness of linear projections, except [Ve]. 

We discuss a discontinuous maximum-entropy inference in an example [Kn]. 
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Example 2. We use the state space S from Example 1, a three-dimensional (trun- 
cated) cone. The linear space U is the real span of o\ ©0 and of (02© 1) — 13/3. The 
generating line [p(0), O2 © 1] of the cone S is orthogonal to U, hence m := 7T[/(p(0)) 
has the fiber F(m) = [p(0),0 2 © 1]. The states p(0) and 2 © 1 being mutually 
orthogonal rank-one projection operators, the von Neumann entropy is maximized 
on F(m) at the centroid c := |(p(0) + 2 © 1), see Figure la. 

It is easy to see that the projection iiu\s is not open at $(m) = c. Using 
v z = 2 © 1 — p(0) we consider the neighborhood C/(c) := {p G 5 | (v z ,p) > — §}• 
The projection ttu(U(c)) contains no neighborhood of m, see Figure lb. Hence ttu\s 
is not open at $(m) and the theorem in the introduction shows that $(m) is not 
continuous at m. The inference $(m) for other points m on the boundary of the 
mean value set yields a state on the base circle of the cone S, showing discontinuity 



We might hope that the discontinuity of the inference $ in Example 2 could 
be artificial as it lies on the boundary of the mean value set — at the best, it could 
be unrelated to the inference $| r iM for generic mean values. Here riM denotes the 
relative interior of the convex set M, i.e. the interior with respect to the affine hull 
of M. However, as is proved in Theorem 2d in [Wi] this is not the case because 



We will give in Corollary 1.1 a proof of this inclusion for continuous disorderliness 
measures and selective fibers of the projection. 

The meaning of (1) is that $| r iM can not have a continuous extension to riMU{m} 
if <3> is discontinuous at m G M. This is an analytical pathology because $| r iM itself 
is continuous, as we shall prove in Theorem 3. For the maximum-entropy inference, 
where $| r iM is real analytic and $(riM) is known as Gibbsian family, a discontinuity 
of $ at m G M precludes a continuous extension of that analytic function. The 
Gibbsian family corresponding to Example 2 is studied in detail in [Kn] . 

Low convergence rates may arise in empirical inference problems for mean values 
near to a discontinuity, see [Kn] for a short discussion. Hence, the discontinuities 
should be taken into account in applications which are sensitive to convergence 
rates, like e.g. Bayesian model selection [Ra]. The latter is an example to use the 
maximum-entropy inference in order to test the validity of a physical law (see the 
introduction in [Ja]), a scientist should look for new laws if a given law has insufficient 
inference properties for the data provided by nature. This article shows that it 
may also be worth checking if not the inference itself has insufficient convergence 
properties due to discontinuities. 

Theoretically, a classification of linear spaces U with discontinuous maximum- 
entropy inference is desirable. 

1. Example 2 suggests that the openness condition for discontinuities might be in- 
dependent of the disorderliness measure, given that the disorderliness measure 
assumes its maximum in the relative interior of each fiber, like the von Neumann 
entropy does. In that example, the projection irjj\s is n °t open at any point of 
the fiber [p(0),0 2 © 1], the exception p(0) lies on the relative boundary of the 
segment. 



of $. 
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Question. Let S be a stable convex body, let m G M = ttu(S) and let 7Tu\s be 
open at a single point p in the relative interior of the fiber F{m). Is then ttu\s 
open at every point of F{m) ? 

If not true for all convex bodies, this question is still interesting for state spaces 
of density matrices. The standard example 

conv [{(x, y, 0) G R 3 | (x - l) 2 + y 2 < 1} U {(0, 0, 1), (0, 0, -1)}] 

shows that a condition like stability has to be assumed. Stability of state spaces 
of density matrices is proved in Lemma 3 in [Shi]. 

2. A convex geometric indicator for discontinuity of the maximum-entropy inference 
was proposed in [Kn]: <3> is discontinuous for linear spaces U where non-exposed 
faces are created at the mean value set M = M(U) when U varies in a Grass- 
mannian manifold of subspaces. From this perspective, it is interesting to have a 
classification of non-exposed faces of mean value sets. Some references and ideas 
are collected in [Ws], others may be found in the recent literature on convex 
algebraic geometry. 

3 Upper semi-continuity 

Discontinuities of the inference have some structure. Under the assumption of an 
upper semi-continuous disorderliness measure, we show that the maximal disorder- 
liness (in each fiber) depends upper semi-continuously on mean values. We include 
this statement for completeness, it should be clear for experts in the field of multi- 
valued maps. 

Let X,Y C E™ be subsets. A multi-valued map 

T : X -> Y 

assigns to each element x G X a subset T(x) C Y. If V is such that the set T(x) 
always consists of a unique element, then T is called single-valued map. 

The multi-valued map r : X — > Y is upper semi- continuous at x G X if for each 
open set G containing T{xq) there exists a neighborhood U(xq) such that x G U(xq) 
implies T(x) C G. The multi-valued map r : X — > Y is upper semi-continuous, 
denoted by u.s.c. if it is upper semi-continuous at each point of X and if, also, 
T(x) is a compact set for each x. A different concept is used for single- valued maps 
/ : X — > R. The map / is upper semi- continuous at Xo G X if for each e > there 
exists a neighborhood U(x ) such that x G U(x ) implies f(x) < f(x ) + e. The 
multi- valued map r : X — > Y is closed if for all Xq G X and y G Y , yo ^ F(xq) there 
exist neighborhoods U(x ) and V(y ) such that x G U(x ) implies T(x) nV(y ) = 0. 

The multi-valued mapping F : M — > S is closed. For a proof we consider for 
X C E n , x G E n and e > the open ball 

B?(x):={ueX\ \\y-x\\<e}. (2) 

For m G M and p G S with p F{m) we can take the positive number e : = 
\\\m — 7T[/(p)|| and the neighborhoods B^{m) and Bf(p). 
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In the Corollary to Theorem 7 in Section VI. 1 of [Be] it is proved for compact Y 
that a closed multi- valued map X — > Y is u.s.c, this shows 

F is u.s.c. (3) 

Since the fibers F(m) for m G M are compact and is upper semi-continuous, the 
maximum max{0(p) | p G F(m)} exists for all m G M (see e.g. Theorem 2 in Section 
IV.8 of [Be]). 

Lemma 1. If (p is an upper semi- continuous dis orderliness measure, then the max- 
imum m h-> max{0(p) | p G F{m)} is upper semi- continuous on M. 

Proof: This follows from Theorem 2 in Section VI. 3 of [Be]. We meet the as- 
sumption that 4> : M x S — > R, 0(m, p) := 0(p), is upper semi-continuous on M x S 
and that F is u.s.c, which we have seen in (3). □ 

Upper-semicontinuity of the maximum is illustrated in Example 2. 

Example 3. The maximal-entropy inference $(7T[/(p(q;))) = p(a) is the rank one 
state p(a) for angles a ^ 0, having zero von Neumann entropy. The inference 
<&(iru(p(0))) = c = ^(p(0) + 2 © 1) has von Neumann entropy log(2). So the maxi- 
mum is upper semi-continuous on the elliptical boundary of M but not continuous. 
It is upper semi-continuous on the whole mean value set according to Lemma 1. 



4 All discontinuities 

We characterize all discontinuities of the inference in terms of open projections of the 
state space. This will be done using Theorem 1, which is equivalent to an argument 
in Theorem 2d in [Wi] — except that we have dropped the unnecessary condition of 
(^j)ieN C riM. Wichmanns original theorem is given as Corollary 1.1. 

Theorem 1. Let (j) be a continuous dis orderliness measure and let the fibers of 
nu\s be selective. Let mi C M be a sequence converging to some m G M and let 
Pi G F{mi) for all i G N. If p { $(m) ; then $(mj) $(m). 

Proof: We assume $(m) = lim^ooPj. Since p { G F(nii) we have 0(p«) < 
0($(mi)) for all i G N. As S is compact, a subsequence of $(mj), which we de- 
note $(mj) in abuse of notation, must converge. Let lim^oo $(rrij) =: p. As is 
continuous on S, we have 

0($( m )) = lim (f)(pi) < lim 0($(m;)) = 0(p). 

i— >oo i— >oo 

Since the fibers of 7Tu\s are selective and since 

Tiui^ijn)) — m — lim Wj = lim 7T(/($(mj)) = nu(p) 

i— >oo i— ¥oo 

this implies p = $(m). The converging subsequence of $(mj) was arbitrary, so 
$(mj) converges to <3>(m). □ 

We denote the closure of a subset X C E n in the norm topology of E n by X. 
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Corollary 1.1. Let <p be a continuous disorderliness measure and let the fibers of 
■Ku\s be selective. Then $(M) C $(riM). 

Proof: Let m G M and let p { C riS be a sequence converging to $(m) G S. Then 
the mean values rrii := nu{Pi) he in the relative interior of M for all i G N, because 
the relative interior of a convex set CcE™ maps to the relative interior of the image 
set of C under a linear map (see e.g. Theorem 6.6 in Rockafellar). Theorem 1 shows 

<3>(mj) $(m). □ 



Theorem 2. Let <p be a continuous disorderliness measure and let the fibers of iiu\s 
be selective. For any m G M the inference $ is continuous at m if and only if 7Tu\s 
is open at $(m). 

Proof: The "only if" part was proved in the introduction, we assume in this 
section that $ is not continuous at m G M. Then, there is a sequence m, C M 
converging to m such that its inference values $(mj) do not converge to <£>(m), 

lim $(m,) 7^ $(m). 

It follows from Theorem 1 that $(m) is not the limit of any sequence pi C S that 
has mean values nu{Pi) = ™i for all % G N. Hence there exists e > and K G N 
such that 

inf min ||$(m) — p|| > e. 
This proves that the neighborhood of $(m) 

V := {p G 5 | ||$(m) -p|| < e} 

does not intersect any of the fibers F(mj) for i > K. So ^ ^{/(V) for all i > K. 

As m, m this proves that 7r[/-(V) contains no neighborhood of m = 7T[/($(m)), 
so 7iu\ s is not open at $(m). □ 



5 Continuity for generic mean values 

This section studies a subset — the relative interior — of the mean value set, where the 
inference is always continuous. A special case is the real analytic maximum-entropy 
inference, where the relative interior of the mean value set is generic in the sense 
that it equals the set of mean values of all invertible density matrices [Wi]. 

The domain of U is the inverse projection of the relative interior of the mean 
value set riM 

dom(f/) := F(riM) = (riM + U ± )nS. 

Continuity of the inference on riM translates through Theorem 2 into openness of 
the projection nu\dom(u), which we shall prove. 
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Remark 1. This section could be head-lined "lower semi-continuity" in analogy to 
Section 3. The u.s.c. property of the fiber function -F| r ;M has an l.s.c. counterpart, 
which is equivalent to the openness of the projection iru\dom(u) on the domain of 
U, see Section VI. 1 in [Be]. It seems we can not draw advantage from multi- valued 
maps in this section. 

We make a transformation such that dom(£7) has non-empty interior in the 
topology of E n . 

Lemma 2. Let A C E n be a non-empty affine space with translation vector space 
V . Let U C E n be a linear subspace and set W := TTv(U). Let a G A and denote 
T(x) := x + a — n w (a) the translation of x G E n bya — ir w (a). Then we have for 
all a G A 

iru(a) = iiu o T o Tx w (a) 
and the restriction t^u\t{w) '■ T(W) — > ^(A) is an affine isomorphism. 

Proof: Let a G A. We show that ttu o T o n w (a) = rcu{a) holds for all a G A. Let 
u G U and a G A. Then, since iiu, 7Ty, n w are self-adjoint, we get 

(u, njj o T o ix w (a) — a) — (u, T o ir w (a) — a) = (u, ir w (a) + a — ir w (a) — a) 
= (nv(u), 7iw(a — a) — (a — a)) = (ttv(u), a — a — (a — a)) = 0. 

On the other hand, since tiu\a '■ A — > irjj(A) is trivially onto, the dimension equality 

dimT(Vy) = dimVT = dimn v (U) = dimvr C /(V) = dim7r c/ (A) 

proves that ku\t(w) '■ T(W) —¥ vr^(A) is an affine isomorphism. □ 

We shall use the neighborhood B €u€2 (x) := B^ i (n u (x)) +B^ 2 (tt u ±(x)) of x G E n 
in the topology of E n , for e\ > 0, e 2 > and balls defined in (2). 

Lemma 3. The projection 7Tu\dom.(u) '■ dom(C/) — > riM open. 

Proof: We can assume that the state space 5cE n has full dimension n. This 
follows by using in Lemma 2 the affine hull A of S and applying the two homeomor- 
phisms, T : W -> and 7iv | T (w) : T iW) ->■ VT[/(A). In the topology of E n , the 

domain and the state space have the same non-empty interior, which we denote by 
dom(C/)°. Let p G dom(C/). If the convex set 

G(p) := (p + C/ ± )ndom(C/)° 

is empty, then by a separation theorem, see Theorem 11.3 in [Ro], there exists a 
hyperplane H D (p + f/ -1 ) which does not intersect dom(C/)°. This shows that TTjj(p) 
lies on the relative boundary of riM. The relative boundary of a relatively open set 
being empty, this is a contradiction. 

We can assume that G(p) is non-empty. Let U (p) be a neighborhood of p, then 
there exist ei, e 2 > such that the neighborhood of p 

V(p) -dom^ni^p) 
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is included in U(p). As G a non-empty, there is a state a G G C\V(p). Indeed, as 
p G dom([7), for any state a G G the state (1 — A)? + Ap belongs to dom(C/)° for 
any < A < 1 (see Theorem 6.1 in [Ro]). All of these states lie in G(p) and for A 
near one they lie in V(p). 

As a lies in dom(C/)° and in V(/o), there exist ?i,? 2 > such that 

Let us choose ?i < ei and ? 2 < ti- Then the neighborhood of p 

W(p) := (dom(E0 n B^M) U 

is included V(/o). We have 7r{/(W(/o)) = B~ i (n u (p)). So 7Ti/(W(/o)) is a neighborhood 
of 7T[/(p). This neighborhood is by construction included in iru(U(p)) so 7r[/|dom(c/) 
is an open map. □ 

Theorem 3. Let <fr be a continuous dis orderliness measure and let the fibers of 
iru\s be selective. Then the restriction $| r iM : riM — > dom(C/) 0/ the inference is 
continuous. 

Proof: This follows from Theorem 2 and Lemma 3. □ 
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